THE TWO WEIGHT INEQUALITY FOR HUBERT TRANSFORM, CORONAS, 

AND ENERGY CONDITIONS 
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Abstract. We consider the two weight problem for the Hilbert transform, namely the question 
of finding real-variable characterization of those pair of weights for which the Hilbert transform 
acts boundedly on of the weights. Such a characterization is known subject to certain side 
conditions. We give a new proof, simpler in many details, of the best such result. In addition, we 
analyze underlying assumptions in the proof, especially in terms of two alternate side conditions. 
A new characterization in the case of one doubling weight is given. 
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1. Introduction 

A weight is a non-negative Borel measure. We are interested in the two weight question for 
the Hilbert transform: For two weights (cr, w), characterize the inequality 

(1.1) ||H(af)||L2(^) <B||f||L2(,). 

Here, the inequality is understood in the sense that there is a uniform bound on the operator norm 
of a standard truncation on the singular integral kernel. Throughout, we will write H^f — H(af), 
and understand at all times that some truncation is in place. The inequality above is in its self-dual 
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formulation: Interchange the roles of w and ct to get the dual inequality. We are also focused on 
inequalities, so throughout we use the abbreviation \\f\\a := ||f||L2(a)- This conjecture, due to 
Nazarov-Treil-Volberg [12], has been the focus of attention. 

1.2. Conjecture. For a pair of weights (w, ct) we tiave ttie inequality (1.1) if and only if these 
three constants are finite. 



(1.3) 


A2 


:= sup sup Pw(x, t)Pa( 

xGR t>0 


x,t) , 


(1.4) 




:=supo-(I) ^ 
I 


|H(ali] 

I 


1^ w(dx) 


(1.5) 




:= sup o-(I)"^ 


|H(wli 


)|^ o-(dx] 



where in the first line, Pw(x, t) denotes the Poisson extension ofw to the upper half plane. In 
particular, the first line is an extension of the classical Az condition, and is referred to herein as 
the A.2 condition. The next two conditions are dual to one another, and are referred to as the 
testing conditions. 

We will keep track of certain constants, like the three constants in the Conjecture above. Many 
of these will come in dual pairs, namely with the roles ofw and cr reversed. An asterisk subscript 
will denote the dual constant, obtained through exchanging the roles of the two weights. The 
exact form of the Poisson integral is not important for us, and throughout we will use this form 
of it. For weight cr and interval I, we set 

-ff(dx). 



P(a,I) := 



|I|+dist(x,I)F 



This is the same, up to constants, as evaluating the usual Poisson extension of o at the (c, |I|), 
where c is any point of I. 

To date, the Conjecture above has only been verified for pairs of weights which satisfy side 
conditions, which help control certain degeneracies in the weights ct and w. These side conditions 
are inspired by the Pivotal Conditions of [7], and were expanded and refined in [5], using the 
notion of energy. Our purposes are two-fold. (1) We will give a notably simpler proof of the best 
known current estimates. (2) We will analyze the proof strategy, introducing new side conditions 
sufficient for the two-weight estimate. These new side conditions are themselves, in a sense to be 
made precise in §3, a consequence of the correctness of the proof strategy. (3) We point out in 
Question 3.3, that it is not known if the proof strategy applies to all pairs of weights for which 
satisfy the two weight inequality. A new characterization when just one weight is doubling will 
follow from this analysis. 

We define for interval I the energy ofw over I to be 

,/|2 

'- — w(dx)w(dx'] . 



E(w,I) 



w n 



I J 



I 



Assuming that |I| = 1 , and w(I) = 1 , this is two times the square of the distance, in the L^(wli) 
metric, of the function xlj to the linear space of constants. The energy constant of a pair of 
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weights (cr, w) is the smallest constant E for which the following inequality holds. For all intervals 
lo and all partitions {Ij : j > 1} of Iq we have 

(1.6) }2 P(a • lo, Ij)'E(w, Ij)^w(Ij] < E^o-(Io) . 
j>i 

Here, inside the Poisson integral, we are identifying the interval Iq with its indicator function, 
which we will do throughout, as this will be a convenience in the heart of the proof. 

A fundamental observation is that the energy constant is finite if the A2 constant and the 
testing conditions (1.4) and (1.5) hold. Namely, it was proved in [5] that we have E < A2 + H. 
This depends upon the specific character of the 1 /ij kernel; its modification for other relevant 
singular integrals is not nearly as simple. 

We turn to the side conditions we need for our Theorem. Fix a choice of < e < ^, and 
integer r > 2. We say that a pair of intervals (I, J) are {e,r]-good if for all } C I, satisfying 
IJI < 2-'-|I|, it follows that dist(J, 91) > lIp^'lJl^ 

1.7. Definition. The Dini energy constant of pair of weight (cr, w) is the smallest constant W 
for which the following inequality holds: There is a decreasing non-negative sequence i^s) with 
Y.s>^ "^(s) = 1 , so that for all integers s 

(1.8) Ms]-' Y. ■ - Ij)' Ij,k)'E(w, < ¥V(Io) . 

In this inequality, we have these conditions. 

(1) Iq is an interval and {Ij : j > 1} a partition of Iq. 

(2) We have secondary partitions of Ij into intervals {Ij^i^ : k > 1}, so that the pair of 
intervals (Ij,Ij,k) are (e, r)-good for all j, k > 1. 

(3) We have |Ij,fc| < 2-^|Ij| forall j,k> 1. 

Note that here, it is certainly required that we consider the Poisson integral of ct restricted to the 
complement of Ij, else we could not expect to get the required decay in s to make the supremum 
finite. 

This is very close to the side condition considered in [5], and is weaker than the pivotal condition 
of [7]. Namely, there is a pair of weights which fail one direction of the Pivotal Condition, but 
satisfy both directions of the side condition above, for i|)(s) ~ 2^^^^^, and the Hilbert transform 
is bounded for this pair of weights. 

1.9. Theorem. Let w, cr be two weights wtiicti do not stiare any common point mass, and for 
some < e < 1 and integer r, tiave finite Dini Energy Constant and finite dual Dini Energy 
ConstantW^. Then Conjecture 1.2 holds. Namely we have the two weight inequality (1.1) if and 
only if the A2 condition (1.3), and the two testing conditions in (1.4) and (1.5) hold. 

This theorem is essentially contained in [5], but the current proof contains many simplifica- 
tions. Basic to the proofs are corona decompositions. We introduce herein a Calderon-Zygmund 
corona, whose use precludes the need for nuanced Carleson measure estimates. We still need a 
sophisticated corona decomposition modeled on one in [7], but in the current formulation we can 
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again avoid appeals to Carleson measure estimates. Prior arguments required a number of such 
arguments. 

One of us initiated the study of two weight inequalities for the maximal function [10] and 
fractional integrals [11]. Cotlar and Sadosky have established two weight variants of the Helson- 
Szego theorem [2], providing a complex analytic solution to the two weight problem. The dyadic 
variant of the Nazarov-Treil-Volberg conjecture is proved in [8]. Two weight inequalities for 
maximal truncations of singular integrals are studied by a completely different method in [4]. This 
paper represents, in some sense, a unification of these two lines of approach. The two weight 
problem for the Hilbert transform is closely related to a number of subjects, including embedding 
inequalities for model space [9] and de Branges space [1]; interpolating sequences for Paley-Weiner 
space [6]; and spectral theory for perturbed operators [3]. 



2. Dyadic Grids and Haar Functions 

Dyadic Grids. A collection of intervals ^ is a grid if for all G,G' G Q, we have G fi G' G 
{0, G, G'}. By a dyadic grid we mean a grid V of intervals of R. such that for each interval I E V, 
the subcollection {!' E V : = |I|} partitions R., aside from endpoints of the intervals. In 
addition, the left and right halves of I, denoted by l±, are also in V. 

For I E V, the left and right halves I± are referred to as the ctiildren of I. We denote by nx> (I) 
the unique interval in V having I as a child, and we refer to tt© (I) as the D-parent of I. 

There is no unique choice of P. To accomodate the notion of an interval being (e,r)-good, 
one must make a random selection of grids, but we have nothing to contribute to this portion of 
the proof. We refer the reader to [5,7] for more details on this point. 

Haar Functions. Let ct be a weight on R, one that does not assign positive mass to any endpoint 
of a dyadic grid V. We define the Haar functions associated to a as follows. 



ct(I_)ct(I+) / L 



\ CT(I) U(I_] 




In this definition, we are identifying an interval with its indicator function, and we will do so 
throughout the remainder of the paper. This is an L^(CT)-normalized function, and has cr-integral 
zero. For any dyadic interval 1°, it holds that {CT(Io)^^''^Io}U{h.5^ : I G , I C Iq} is an orthogonal 
basis for L^(Io(j). 

We will use the notation 

The second equality is the familiar martingale difference equality, and so we will refer to Aff as 
a martingale difference. It implies the familiar telescoping identity Ej^f = . i^j Ej^AJ^f . Finally, 
we will need the estimate below, which follows immediately from Cauchy-Schwartz. 



(2.1) E^^hf < a(I± 
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Good-Bad Decomposition. With a choice of dyadic grid V understood, we then slightly change 
the definition of (e, r)-good. We say that J G P is (e, r)-good if and only if for all intervals I E V 
with |I| > 2^+^ |J|, we have that the distance from J to the boundary of either child of I is at least 

For f G L^(cr) we set Pgoodf = H leP The projection P^od^* defined similarly. 

I is (e, r]-good 

Important elements of the suppressed construction of random grids [5,7] are that 

(1) It suffices to consider a single dyadic grid V, but we will sometimes write and to 
emphasize the role of the two weights. 

(2) For any fixed < e < ^, we can choose integer r sufficiently large so that it suffices to 
consider f such that f = Pgoodf' likewise for cf) G l-^iw). 

Concerning the last property, this is, at some moments, an essential property. We suppress it in 
notation, however taking care to emphasize in the text those places in which we appeal to the 
property of being good. 

3. Analysis of the Splitting Assumption 

Our principal concern is the bilinear form B(f, (()] := (Ho^f, 4))^; let B be the best constant in 
the inequality |B(f, cf))] < B||f 

We define two more forms here. Throughout the paper by J e I we mean that I, J are dyadic 
intervals, in a fixed dyadic grid, and J C I with |J| < 2^''|I|, with r the fixed integer in the 
(e,r)-good property. Define 

lev : ]ml 

where Ij denotes the child of I that contains J. And, as mentioned in the previous section, we will 
identify an interval and its indicator function. Denote by B3,(f, (p) the dual bilinear form obtained 
by interchanging w and ff. See Figure 1 for a diagram illustrating the definition of these two 
forms. Set Eg be the best constant in the inequality 

|BJf,4))|<B,||f|U||4)|U, 

and B5, be the best constant in inequality for the dual bilinear form. 

In order to state our first main result, we need one more constant. Let W be the best constant 
in the inequality 

(3.1) |(HJ,J)^| < Wo-(I)^/^w(J)V^ 

where I and J are intervals with 2^^|J| < |I| < 2^|J|. Recall that the integer r is fixed. It is known 
that W < A2 + mm{H, H J. 

3.2. Theorem. Assume that the pair of weights satisfy the A2 bound, and the two interval testing 
conditions (1.4) and (1.5). Then, assuming f = Pgoodf ^"'^ likewise for 4>, it holds that 



B(f, (t)) - {B«(f, (t)] + B,(f, 4))}| < { + W}||f II, 
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Figure 1. A schematic diagram for the two forms Bg and B,. The dashed lines 
around the diagonal indicate that the terms associated with 2^^|J| < |I| < 2^|I| 
are treated in Theorem 3.2. 

That is, the boundedness of H is equivalent to that of the sum Bg + B^,. The remainder 
of the sufficiency proof for the main theorem is based upon the assumption that Ba(f, g) and 
B3(f, g] are bounded independently of each other. It is commonplace in classical settings that 
this assumption holds. ^ This brings up the following 

3.3. Question. Let w, a be a pair of weights. Does it hold that + Bg < B./" 

Without an answer to this question, we cannot be sure that the approach to the two weight 
question used in this paper, and in [5,7] can even succeed. Currently, there is no other approach 
to this question. 

We introduce two new side conditions, more general, and more complicated, than the Dini 
condition; these conditions are phrased in terms of a dyadic grid, which is after all not fixed. 
The purpose in phrasing them is to provide precise description of those objects which the side 
conditions control. 

3.4. Notation. For a subset of the dyadic grid V, it is convenient to refer to as a sub-tree 
of the dyadic grid, and it useful to think of moving up or down the J-'-tree, moving by inclusion. 
For dyadic I G I', we set Ttjrl, the J-'-parent of I, to be the minimal element of that contains 
I. We set Tt^I = TTjrl, and inductively define tt^U to be the minimal element of that strictly 
contains Tt^I. This has the consequence that if F G J-", then Tijrf = F. We write ChildjF(F) for 
the maximal elements of which are strictly contained in F, and call them the F-children off. 

3.5. Definition. Given interval 1° we set J-'(Io) to be the maximal dyadic subintervals F such that 
E^|f| > 4El\-f\. We set Jo = {1°}, and inductively set 

-^j+i U J'i^) ■ 

Then, the collection of f-stopping intervals is J-' := Uj^o-^j- 

^In various Tl theorems, there are canonical choices of paraproducts, which are bounded by the assumptions 
of the Tl theorem, whence they are freely added and subtracted in the proof. In the current setting, there is no 
canonical choice of paraproducts. 
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A basic fact, a consequence of the universal maximal function estimate, is 
(3.6) }^y(F)MF)<||f||i, y(F)=E,nf|. 

This is referred to as the quasi-orthogonality condition. 

We take f non-negative and supported on an interval 1°, and f-stopping intervals as above. Let 
{gp : F G J-"} be a collection of functions in L^(w] so that for each F, 

(1) Qf is supported on F and constant on F' G Childjr(F); 

(2) letting J*[¥] be the maximal intervals J* such that J* <s F, J* is (e, r)-good, and ttjfJ* = F, 
we have Ej^g^ = for each J* G J*. 

We say that gp is F-adapted to F. Let F be the smallest constant in the inequality below, holding 
for all non-negative f G L^(0], and collections {g^} as just described. 

Y_ Y. PWR-HcT,r)(^,gFr)^<F||f||J}^||gFir''^' 

We refer to this as the functional energy condition. Taking to be a partition of an interval lo, 
and f = lo, we can recover the energy condition (1.6). We denote by F* as the dual condition, 
with the roles of w and ct reversed. 

The second condition is as follows. We write f G BFjr(F), and say that f is of bounded 
fluctuation if (i) f is supported on F, (ii) f is constant on each F' G Childjr(F), and (ii) for each 
dyadic interval I C F, which is not contained in some F' G Childj-(F), we have EJ^|f| < 1. We 
then denote as BF the best constant in the inequality 



(3.7) I Y_ Z EJ^A'^f ■ (H Jj, Af g)^| < BF{ct(F) 



^1/2 



I : 7t^I=F J : Jal 

7t^J=F 

where f G BFjr(F), and g is J^-adapted to F. One must note that the two terms a(F)^/^ and 
||f||a. on the right above are in general incomparable. We refer to this as the bounded fluctuation 
condition. 

This condition is a consequence of the boundedness of the form Bg, a fact which is not hard, 
and is proved below. ^ The role of the constant one in the inequalities EJ^|f| < 1 is immaterial. It 
can be replaced by any fixed constant. Indeed, if the measure a is doubling, we could replace 1 
by a constant depending only on the doubling constant, then the bounded fluctuation condition 
reduces to the function being in L°°. 

The following Theorem summarizes much of the content of this paper. 

3.8. Theorem. The following inequalities and their duals hold, for any pair of weights w, o which 
do not share a common point mass. 

(3.9) Ba<H + F + BF, 



^But it is not known to us that the bounded fluctuation condition is a consequence of the norm boundedness 
of the Hilbert transform. 



8 



M.T. LACEY, E.T. SAWYER, C.-Y. SHEN, AND I. URIARTE-TUERO 



(3.10) F<W, and BF<H + V, 

(3.11) F + BF< y^ + W + Be. 

In particular, Theorem 1.9 is a corollary to the first two inequalities above, and their duals. 
The interest in (3.11) is that it shows that the new side conditions, of functional energy and 
bounded fluctuation, are implications of the proof strategy, namely the assumption that the 
bilinear form Bg(f, 4)) is bounded. Concerning (3.10), the side condition controls the functional 
energy inequality by a straightforward argument, but the control of the bounded fluctuation term 
is a deep argument, §6, initiated in [7]. 

3.12. Question. For a pair of weights (w, ct), do any of these inequalities hold? 

F < B, 

F < ^A^ + H, 
BF < B, 

BF < ^A^ + H. 

Note that the condition of functional energy is only about non-negative f, and the 'energy' of 
the weight w. It is arguably an acceptable hypothesis to add to Conjecture 1.2; unfortunately, 
neither functional energy nor bounded fluctuation conditions admit an intrinsic formulation. The 
inequality on bounded fluctuation goes to the heart of the conjecture. 

Finally, we indicate a new characterization of the two weight problem when just one weight is 
doubling. This should be compared with the results of [4], which address maximal truncations, 
and also contrasts with a characterization in [7] when both weights are doubling. 

3.13. Theorem. Let (w, a) be a pair of weights with cr doubling. Then, the two weight inequality 
(1.1) holds if and only if these constants are finite. 

A2 , H , H, , F , BF < 00 . 

Proof As ff is doubling, there is a constant c so that for any interval I, and any subinterval I' of 
length I of I, it holds that a(r) > ca(I). From this, it follows that E(ct, I) > c/4. Namely, the 
energy of any interval is strictly bounded away from zero. Assuming the finiteness of A2, H*, as 
we may do in both directions of the argument, one may easily verify that the dual Dini condition 
holds, that is V* < 00. (In fact, the pivotal condition of Nazarov-Treil-Volberg holds, as follows 
from the energy condition (1.6), which is necessary from A2 and H.) 

Assuming that the Hilbert transform is bounded, we necessarily have the finiteness of the A2 
and testing constants. Therefore, the dual bilinear form is bounded, B3) < 00, hence Bg is also 
finite, bounding F and BF, as claimed. 

In the reverse direction, the finiteness of A2, and V* implies the boundedness of Bj,, 
and the additional assumptions on functional energy F and bounded fluctuation BF imply the 
boundedness of Bg, hence the Hilbert transform is bounded. □ 
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4. The Splitting of the Operator 

We expand the full bilinear form B(f, g) := (H^f , c|))w according to the weighted Haar basis. 
For the proof, we will take some (large) interval 1°, and assume that f and cj) are supported on 
1°. Note that by the testing hypothesis, 

Ej^of(H,(I°),(t))^| < H|Erof|a(I°)i/2||(l)|U. 

The dual inequality also holds, so we are free to assume that f and ^ have respective means zero, 
and hence are in the closed linear span of the (good) Haar functions. 

In the first generation, there are three terms, which are largely 'below diagonal', 'diagonal', and 
'above diagonal' parts. 

(H,f, (1))^ = Bi,i (f, 4)) + Bi,2(f, + Bi,3(f, , 

Bs,t(f,*):= (H,AJ^f,Af(t))^, 

(4.1) Pi,2 :={(!, J) : 2-|I| < |J| < r|J|} 
Pi,3 :={(!, J) : |J|<|I|}. 

The term Bi i is dual to Bi 3, so we do not explicitly define it here, as we will concentrate on Bi 3. 
The diagonal term is straightforward to control, and in §8.1, we will show 

(4.2) |Bi,2(f,4))|<(A2 + W]||f||,||4)|U. 

We shall follow this pattern of postponing certain estimates that are 'routine' till a later section, 
preferring to pass to the more delicate parts of the decomposition, which will always have the 
larger second indices. 

We concern ourselves with the term Bi_3 defined in (4.3). And we right away split it into an 
'far away', 'local', and 'inside' part, defined as follows. Set 



(4.3) 
(4.4) 



Bl,3 
V2,2 

V23 



B2,l + B2,2 + B2,3 



{(I,J)gPi,3 : 3lnj = 0}, 
{(I,J)ePi,3 : JC3I\I}, 
{(I,J) GPi,3 : Jel}. 
In §8.2 and §8.3, we will show that these two terms are also controlled by the A2 constant. 

(4.5) |B2,i(f,4))| + |B2,2(f,*)| < A2||f||.||4)|U. 

Concerning the term B2,3, we will make this further decomposition. For the pairs of intervals 
(I, J) in question, we have J Q I. Recall that Ij is the child of I that contains J. Now, the 
argument of the Hilbert transform is Aff, which is constant on the two children of I, namely Ij 
and I\Ij. This permits us to write 



B 



2,3 



B3,i + B 



3,2) 
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(4.6) B3,i(f,(|)]= Y. E^^i A^f •(H,(I-Ij),Afcf))^, 
We will show in §8.4 that we have 



(4.7) B3,i(f,4)] 



< A, 



The bilinear form B32 is the form Bg of §3, and this is the notation that we will use below. 
Our considerations to this point, together with their duals, completes the proof of Theorem 3.2. 

5. The Calderon-Zygmund Corona 

This section will be devoted to a proof of the inequality (3.9), namely that the side conditions 
of functional energy and bounded fluctuation can be used to control the bilinear form Bg. This 
is the first of the two important corona arguments in the paper. The reader should recall the 
definition of f-stopping intervals in Definition 3.5. 

5.1. Remark. The intervals are the standard construct in proving paraproduct style arguments, 
moreover the identification and control of paraproducts is an essential part of the two-weight 
problem. Thus, it is natural to incorporate these intervals into the proof at an early stage. 
Indeed, if this step is not taken, nuanced Carleson measure estimates are needed. 

5.2. Definition. [The Calderon-Zygmund Corona Decomposition] For F G J-", we say that the 
pair of intervals (I, J) G P2,3 afe in C(F) if and only if TtjrJ = F. This definition only depends 
upon J. We set Co(F) to be those pairs (I, J) G C(F) such that ttjtIj = F. Note the dependence 
of this definition on the pair (I, J). And, let C°(F] =C(F)\Co(F). Define associated projections 

Note that the latter projections are pairwise L^(w)-orthogonal in F G J-", and we have 
(5.3) ^||Prct)||^<||ct)||^ 

We use a similar, but distinct, notation Pff := . ^^i^^p A^f . Here, we sum over all I so that 
a dyadic child of I has J^-parent F. These projections are not orthogonal in F, but nevertheless 
satisfy a variant of (5.3) that we will need. 

The (Calderon-Zygmund) corona decomposition of the bilinear form Bg is then based upon the 
f-stopping intervals, hence non-linear in nature. 



Bg(f,ct)] :=}^^BJf,(t);F), 



fgJ" t=i 
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(5.4) Bi(f,ct);F) 
B2(f,ct);F) 
B3(f,ct);F) 



Y_ EpA^f-(H,(lF\F),Af4)K 

(IJ)GC°(F) 

Y_ EpAff-(H,(F),Af(t))^, 

(iJ)ec°(F) 

Y_ ErAff.(H,(Ij),A]r(l))^. 



ij— 1 ■ \- -UV-J^,-1J 

(iJ)eCo(F) 

Let us argue that we have equality above. The term B3 is the only one that depends upon Co, and 
it will be further decomposed below. The remaining two terms depend upon the complementary 
part of the corona C. For (I, J] G C°, note that Ij strictly contains F, Ajf is constant on Ij D F, 
hence EpAJ^f = EjjAff. And, we have written Ij = Ip = F + (If\F) to get the two terms Bi and 
B2. 

5.1. The Term B3. We will show in §6 that we have the inequality 

(5.5) |B3(f, ct);F)| < {H +W}{yi¥M¥y/' + ||Pp^f ||4||Pr4)|U 

An application of Cauchy-Schwartz, and (quasi)-orthogonality will complete the estimate of this 
term. 

5.2. The Term B2. We claim the estimate 

(5.6) |B2(f,ct);F)|<Hy(F]a(F]i/^||Pr4)|U, 

In view of the quasi-orthogonality condition (3.6) and (5.3), a trivial application of Cauchy- 
Schwartz will complete the estimate of this term. Namely, we have 



}^|B2(f,(t);F)|<H 



}^y(F)VF)x}^||P-(t)||: 



1/2 



<Hi 



The proof of (5.6) is quickly obtained. We estimate, using the telescoping property of martingale 
differences, 

(H,F,Af(t))^| 



|B2(f,4))|< Y_ Y_ EpAj^f 

]:n^]=l I: (IJ)eC°(F) 

= |Ep^f|||H,(F)|U||Pr4)|U 

<Hy(F)a(F)i/^||Pr4)||w. 

The expression E^Aff arises above since for (I, J) G C^i^), it holds that F c Ij. Hence, the sum 
of martingale differences can be summed exactly as above. 

For B3(f, (j); F), the definition of bounded fluctuation in (3.7) was constructed for this term. 
Namely, the function (Cy(F))^^f ■ F is in BFjr(F). The function P™4) is J^-adapted to F, whence 

B3(f,(t);F) <BF{y(F]a(F]i/^+||Pp-f||,}||Pr(l)|U 
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An application of Cauchy-Schwartz, and appeal to (quasi-)orthogonality will complete the analysis 
of this term. 



5.3. The Term Bi. The analysis of (5.4) is based upon the functional energy condition, and 
leads to this estimate: 



(5.7) Bi(f,ct)] 



< F| 



0- >+' w 



The following lemma records a monotonicity property for the Hilbert transform, and a property 
involving the Poisson integral. 

5.8. Lemma (Monotonicity Property). Suppose that y is a signed measure, and ]x is a positive 
measure witti ]x > \y\, botti supported outside an interval I. Let J C J* d I. Tlien it tiolds that 



(5.9) {ny.hY)^ <(H^i,hJ 
In addition, we have the estimate 



(5.10) P(^,r]( h-)^ <(H^i,hf) + 



1 1-e 



III 



The function H|j. will be monotonically decreasing on J, and we have chosen the definition of 
the Haar functions so that {^H|j., h,]*^ is non-negative, while (x, h.j^)w is negative. 

Proof. This argument is special to the Hilbert transform. Let J- = J n (— oo, c) and J+ = 
J n (c, oo). We may renormalize the Haar function hy so that 



J- 



dw 



dw = 1 . 



Then we have 



Hy [x] (x) dw (x) + 



Hy (xl 



hf (x: 



dw fx) 



[Hy (x) - Hy (x') 



J- 



Hy (x) (x) dw (x) 
Hy (x')|hf (x')|dw(x') 
dwfx'l hTix] dwfx] 



X — X 



R\J 



(y - (ij - X 



h-(x'] 



dw (x'l 



Hf (x: 



dw fx] 



and since > for ij G R \ J and x G J+ and x' G J-, we have 



Hy,h^ 



< 



]- 



X — X 



R\J 



iv -x]{y- X 



(x'l 



dw [x!] 



hf (x: 



dw fxl 



H\i,K 
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where the last equality follows from the previous display with \l in place of a^. This concludes the 
first half of (5.9). 

For the second estimate (5.10), we will make a first order Taylor polynomial approximation of 
H|j. on the interval J. Let us denote the derivative by D, and for x G J note that 



DH|j,(x) 



1 



1 



|x(dx] 



The point here is that the second derivative is somewhat small. From this, we can write, letting 
Cj be the center of J, 

HiJ.(x] - Hm.(cj] - (x - Cj]H[j.(cj] < sup(x - Cj)^|D^H|j.(x]| 



< 



-P(^,J], XGJ. 



dist(9J,I) 

When we are estimating the inner product with a Haar function, constants are immaterial, 
therefore, using the fact that J is good, we can write 

hl-e 



(H^,hp^-DH^(cj)(x,li]r)^ 

Finally, one uses DH|j.(cj) < | J|^^P((j., J). This finishes the argument. 
Returning to the analysis of Bi, write 



< 



□ 



Note that If^l < f. Let i7*(F) be the maximal (e,r)-good intervals J* e F. Applying the Lemma, 
we have 



|Bi(f,(t);F)| < Y_ ^ l(HafF,Af4)), 



^ Z Z p(fcT(R-F),r; 



7t^J=F 



J*eJ'(F) JcJ* 

7T^J=F 



IJ* 



where (p \s a obtained from ^ by an appropriate w-Haar multiplier, chosen to make all the inner 
products above non-negative so that the absolute values can be removed. By orthogonality of the 
projections P^, and the definition of the functional energy condition, we see that the sum over 
F G J-" of this last expression verifies (5.7). 

To be specific, an operator T is a w-Haar multiplier if it is of the form Tcf) = Y_]ev ^]^T'^' 
with |£j| = 1. These operators are isometries on L^[w). The multiplier we need has £j = 
sgn((4),Hp^), and ^ = T(t). 
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6. Bounded Fluctuation and the Second Corona 

There are two estimates of the bounded fluctuation constant BF that should be made, the 
easy estimate of BF < Bg, and the difficult estimate of BF < y/Aj+W. We turn to the second 
estimate, which is involved. 

Fix the data for the bounded fluctuation term. F is an interval, and Child (F) are the intervals 
inside F; the function f is of bounded fluctuation relative to this data, and ^ is adapted to 
{F} U Child (F). We consider the difficult estimate, in which the Dini and testing conditions 
dominate the bounded fluctuation term. Setting notation, we are to show (5.5), which is the 
same as this estimate. 



I : 7rjrI=F J : Jel 

(6.1) |B3top(f, *)| {H + V}{y(F)cT(F)V2 ^ ||f ||^ j . 

The origins of this argument are derived from [7], as modified in [5]; these papers refer to this 
term as the stopping term. We will again find simplifications by the use of the Calderon-Zygmund 
corona. We define here the Dini corona. 

6.2. Definition. Let Iq C F. We set iS(Io) to be the maximal subintervals S c Ig such that 
(6.3) ¥^(Io,S)2>4V^a(S). 



oo 

¥^(Io, S)^ := sup 11^(5)-^ ^ P(a ■ (lo - Ij), Ij,k)'E(w, 

where the supremum is formed over the various data that enter into Definition 1.7, to wit: 

• {Ij : j > 1} is a sub-partition of S into intervals; 

• {Ij,k : k > 1} is a sub-partition of Ij into good intervals, 

• s > r is an integer and < 2^^|I)|. fof all j>k. 

We then set S := U^o'^s. where So = {F}, and inductively, Ss+^ = Use5s'^(S). 

It is important to note that despite the assumption of the Dini energy condition, there is no 
a priori upper bound of the quantity ^w(Io)S] in terms of cr(S). We also have the following 
elementary estimate, but critical, 

(6.4) o-(S)<|o-(Io). 

se<s(io) 

We have by (1.8), 

Ses Ses 

The constant divides out, so that (6.4) holds. 

The Dini corona decomposition of Co(F) is then the collection of pairs B{S], of those (I, J) G 
Co(F) such that J has 5-parent S. We further write B{S) as the disjoint union of Bo{S)UB° [S] 
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where Bo[S) consists of those pairs (I, J) G B{S], where Ij c S. This definition is carefully crafted 
so that (1.8) fails for Ij if (I, J) is in Bo(S). 

We then split the term Bstop(f, ft") up according to the corona. The argument of the Hilbert 
transform is also split up. Here, it is a basic fact that for each J, the function 

(6.5) bj:= Y. ^^i^-h 

I: (IJ)eBo(F)UB°(F) 

is supported on F, and has L°° norm dominated by 2. The Hilbert transform is applied to Ij. Let 
S be the 5-parent of J, We will write this as 



It = 



S-(S-Ij) IjCS, equivalently, (I, J) eB^iS), 
(Ij - S) + S Ij D S, equivalently, (I, J) G i3°(S) . 

And this permits us to write 

Bstop(f, *) = }^ Bi (f, 4); S) + B2(f, S) - B,{f, cf); S) 

ses 

(6.6) Bi(f,(t);S):= Y. Aff ■ (H,(S), Afcf))^ 

(iJ)eB(S) 

B2(f, S) := Y ^lAi^ • <^-(^J - ^r*)™ 

(i,J)eB°(S] 

Bslf, S) := Y. ^^'^ ' ^^-^^ " ^J^' ^f*)- 

(i,J)eBo[S) 

6.1. The control of B3. We take up the most delicate case of B3(f, ct);S), showing that 

(6.7) |B3(f,(t);S)| < Y||Ps^f|U||P^(t)|U, S eS. 

Here, the projection Pgf is onto the span of the Haar functions hj^ such that a child of I has 5- 
parent S, and P^cf) has an analogous definition. Note that projections P^ are pairwise orthogonal, 
while a given Haar function can only contribute to at most two projections P5 . This and application 
of Cauchy-Schwartz will show that 

}^|B3(f,(t);S)|<V||f|U||(t)|U, 

SG5 

which is as required in (6.1). 

In the main estimate, we hold the relative lengths of I and J constant. It holds that 



|B3,s(f, 4);S]|:= 



Y EJ^A^f(H,(S-Ij),A]r(|)). 



(iJ)eBo{S] 

21Jh|I| 



<-4^iKs)||Ps^fLl|P^4)lU, s>r. 

The constants V and ^\>is) are as in Definition 1.7, and in particular, Xsi^ls) < 1- This is 
summed over s to finish the proof of (6.7). 
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To prove the inequality above, we use this observation. For any choice of sign, 

This is the elementary property of the Haar functions of (2.1). We apply Cauchy-Schwartz in I 
to the expression below 



I:7t5l=S 



Y_ |(H,(S-Ij],A-(t)), 



■J: 



. [IJ]eBi(S) 



^l|Psfi|lM|Ps'4)||^xM 



sup supa(Ie)-^ Y. (Ha(S - I9), Hf )^ 



1 : 7:51=5 ee{±} 



Here, to get the bound in terms of ||P^4^||^. we use the fact that for fixed J C S, there is a 
unique I so that J C I, 2'|J| = |I|, and (I, J) G Bo- We turn our attention to M^. Applying (5.9), 
and the definition of the ^-functional in Definition 1.7, that we have 

cT(Ie)-^ Y_ (H,(S-Ie),Hf)^<-y^^Tl;(s)^cT(Ie)-^^^(F,Ie) <4il)(s)^-4'^ 

J: 2^IJI=|I1 
Il=Ie 

In the last inequality, it is decisive that the interval Ij c S, hence fails the condition (6.3). 

6.2. The Control of B2. For S E S, let be the projection onto the span of Haar functions 
hy with Tt^J = S. By Lemma 5.8, there is a function (p, a w-Haar multiplier of cj), so that 

|B2(f,(t);S)|<(H,(F-S),P^^)^ 

= (H,F,P^^)^-(H,S,P^^)^ 
Now for the first term on the right above, 

(6.8) (H.F,P^^)^ = /h,F, Y_ P^*) <Ha(F)i/2||Pr({)|U 

And, for the second term on the right above, 

}2KH,S,P^^)„| <H}^a(S)^/2||P^(t)|U 



S65 



se5 



< H 



ses ses 

l/2||pw|| 



1/2 



< Hct(F)'/^||P^ 
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Here, we have appealed to the critical estimate (6.4). This with (6.8) completes the bound of 

6.3. The Control of Bi . The bound for Bi , as defined in (6.6), is straightforward. Recalling our 
observation that the functions bj in (6.5) are bounded in L°° by y(F], one can appeal directly to 
the testing condition to see that 

|B,(f,<|>;S)|<H.M.cr(S)"^||PJ'4)|U, 

L 

I:(IJ]GB(S) 

But M <y(F). 

Using the orthogonality of the projections P^, and the condition on the stopping intervals (6.4), 
one sees that 



sup 



}^|Bi(f,ct);S)|<Hy(F) 



Ses 



}2a(S)x}2||P-(t)||^ 

.SG5 Ses 
<Hy(F)CT(F)i/2| 

This is as required by (6.1). 



1/2 



|pr*ii. 



6.4. A Second Estimate. We have completed the proof of (6.1), and turn to the easy estimate 
BF < B<5 + H. Indeed, if we are given a function f and <p with which we are to test the bounded 
fluctuation condition, note that the sum that appears in (3.7) reduces to Bstop(f, 4^). But, we 
have 



Bstop(f,4))-B«(f,(t)) 



< 



< 



Y_ ^EpArf(H,(Ij),g) 



I : I3F J : JeF 



Ff(H.F,g) 



+ 



I:I3F 



•pA^f(H,(lF-F),g) 



The first term is bounded by H0(F)^/^|| g||w. The second term is zero, since f is supported on F, 
hence . E^A'^f ■ (Ip - F) = 0. 

We argue that H < + W + Eg, which completes our proof of (3.11), that is BF < 

yA^ + W + Bg. 

Let us fix an interval 1°, and function ^ E L^(w) supported on 1°, for which we are to estimate 
(Ho^l", 4))^ in terms of the A2 constant, the weak-boundedness constant, and the split form 
constant Bg. By appealing to the weak-boundedness constant W, we can assume that ^ has 
w-integral zero. We are also free to consider (random) dyadic grids V, with respect to which I*^ 
is dyadic. It follows that we can take 4) in the linear span of {h™ : J C I*^, J is good}. 

By appealing to Theorem 3.2, it suffices to consider the sum of the two forms B5,(I°,4)) + 
Be(I°,(i)i. But 6^(1°, 4)) is zero: If Af4) ^ 0, then J C 1°, and so for any I C J, we have 
A^V^ = 0. The form Bg(I°,4)) is controlled by the constant Bg. So our argument is complete. 
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7. Dominating Functional Energy 



We have two estimates of the functional energy constant to prove, that F < H + W and 
F < H+Bg. The data for these considerations are a non-negative function f , its stopping intervals 
J^, and a sequence of functions {g^ : F G J-'}, with gp J^-adapted to F. We assume that H and 
Bg are finite, and consider the expression Bg(f, gp). By the monotonicity property Lemma 5.8, 
it suffices to assume that f takes the value y(F) on each set of the form F — IJ Childjr(F). The 
point to observe is that if we write 

fF = f-(F-UChild^(F))+ Y. Ep'f-P', 

F'eChild^(F) 

we can appeal to the bounded fluctuation property to write 

Bg(f ■ F, gp)| < BF||fp||,||gp|U < { + Bg}||fF||a|| g^lU • 
Applying Cauchy-Schwartz to the last two products of norms, we get 



< 



< 



\2 
iFlla 



FeJ^ 



1/2 



1/2 



< 



where we write g = ^pgjr gF- Note that the condition that the functions g^ be J^-adapted implies 
that they are orthogonal in F G J-". 

We conclude that on the assumption that ^/A2 and Bg are finite, we have 

}^|Bg(f - (f ■ F), gp)| < { + Bg}||f ||,||g|U • 
FeJ' 

With the assumptions on the functions g^, in the definition of functional energy, and the inequal- 
ities (5.10) are at our disposal. This must be done in a way that controls the right-hand side of 
that inequality. 

Let Haar multiplier g^ of g^, chosen so that (x, A]^gF)w > fof all J- We have from (5.10) the 
estimate 



(7.1) p((f-(f-n)cT,r 

oo 

D(f,gF) :=}ly(7r^n Y. 

t=1 J : JaF 

7rpJ=F 



X 



gF) <Bg(f-(f ■F),gp) + D(f,gp] 



iTt^Fl 



1-e 



PlaTt^FJ) v'w(J)|(gp,h-), 



Note that we are appealing to the specific form of f to obtain the form for D(f, g^). If the first 
term on the right in (7.1) is the larger, we are finished with the proof. Otherwise, we will use the 
Az constant to control the terms D(f, g^). 
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Note that by repeated application of Cauchy-Schwartz in different variables, it holds that 

^ r ITI nl-e , ^ 



t=l 



J:JeF 



J:J<aF 

7tFj=F 



1-e 



t=i 

oo 

^\\9f\\i^yin':F^)'t'Y. 

t=l J : Jol 

TTpH 

oo 



P(cT7r>F,J)Jw(J) 



■|F|- 


l+e 


[ IJI 1 






[\n^f\\ 



2-2e 



'p(a7t>F,J]^w(J) 



t=i 



J:J<sF 

7tFj=F 



iTt^Fl 



l-3e 



P(o-7r^F,J) 



(7.2) 



t=i 



This holds if 1 — 3e > ^, which we can assume is true, as this choice of e is only associated with 
the definition of being good. For any < e < 1 , we can make a choice of integer r so that it 
suffices to consider only (e,r)-good intervals. From the (quasi)-orthogonality conditions on f and 
{gp}, it is then easy to see that 



This completes our proof of F < ^/A2 + Bg. Indeed, to verify the last inequality, let us write 
g = LFe^gF. and apply (7-2). 



^D[f, gp) < A2_^||gF| 



^ \/A^llg||w 



1/2 



£2-^/V(7tVF)V(7rVF) 
t=i 

oo 

^^2-^/V(^^n'cT(7r^F) 



feT t=i 



1/2 



2 y w 



^yF]^a(F] 



1/2 



< a/a 



2 ' k y w 



it remains to prove F < V, which follows from an elementary application of the side condition. 
We reorganize the sum around the J^-ancestors of an interval F G J-" 
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(7.3) 



< 



t=2 



*I|2 
w 



1/2 



Here, we have appealed to (1.8) with the data Iq <— F, {Ij} <— {F' G J-" : tt^- = F}, and 
{Ij,k} ^ [J{J*[^') ■ F' G J" : 7t^ = F}. It follows that we have {\^} are (e,r)-good, and 
llj^kl < 2^^|Iil, and therefore, we have 



Y_ P(cT(F-F'),r)' Y. E(w,r)^w(r)<W^Tl;(t)cT(F) 



The inequality (7.3) then follows from Lemma 5.8, and the assumption that Ej^g^ = for all 
F G J", and J* G J*[V). In particular, we have 



— ,gpr) <E(w,r)w(r)i/2||gF-r 



Now, we have the quasi-orthogonality estimate (3.6). Using the condition Y.t>2'^^^^ = 1, we 
have 



t=2 



.F'eJ-:7r^=FJ*eJ*(F') 



*l|2 
w 



1/2- 



<XL^w }l Y. ii9p-rii^< iiofIi^ 

FeJ^ t=2 F'6J=":7T^=F J*G:7*(F') 



8. The Remaining Estimates 



8.1. The Term Bi 2- The term Bi 2 is defined in (4.1); we are proving the estimate (4.2), where 
the constants on the right are the A2-constant and the weak-boundedness constant in (3.1). 
Note that by the definition of the weak-boundedness constant, we have 



Y_ (H,Aff,Af4))^< W Y 

3in3j9^0 3in3j5t0 
(hi) ■ 2-''|I|<|J|<2''|I| ^'-''> ■ 2-''|I|<|J|<2''| 



(f,hr),((t),Hn, 



if sinsj = 

(8.1) |(H,(I),hf)^|<a(I) 



^W||f|U| 

and IJI < |I|, we have the trivial consequence of (5.9), 



w 



dist(I,J))^ 
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Now, the estimate (8.1) implies that the remaining part of Bi 2 is controlled by 
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(H,Aff,Ar'4))^< 



, , . 3in3j=0 



3In3j9t0 



(x(I,J) := 



2"''|I|<IJI<2''|I| 

a(I)V^w(J)V^|J| 
(|J|+dist(I,J))2 



The last coefficients satisfy the assumptions of Schur's test, with the relevant constant controlled 
by the A2 constant. Namely, for any I we have by the Cauchy-Schwartz inequality. 



(8.2) 



.r 3In3J=0 
■2"''|I|<IJI<2'^|I| 



< 



(IJ) : n-r 



3in3j=0 



2"''|I|<IJI<2''| 



X 



dist(I,J])2 



(IJ) : 



3In3J=0 
2-''|I|<|J|<2''|I| 



dist(IJ))2 



This completes the proof of (4.2). 



8.2. The Term 62,1 . For the term 82,1 of (4.3), we prove (4.5). In this sum, we have 2^|J| < |I|, 
and J n 31 = 0. We further restrict the length of J to be 2^^|I|, for s > r. Using the estimate 
(8.1), we can apply the Schur test to see that 

Y_ (H,Arf,Af4))^ < 2-^/2 A2||f||a||(t)||w. 

, , . 3in3j=0 
U.JJ ■ 2-''|I|<|J|<2''|I| 

Indeed, the only point to observe is that for the analog of the first term on the right in (8.2), that 
we have the geometric decay claimed above. 



8.3. The Term 62,2- For the term 62,2 of (4.4), we prove the second half of (4.5). The distinction 
between this case and the previous is that this is the 'local' but not 'inside' part. For integers 
s > 1 , we prove 

Y_ (HaArf,Af(t))^<2-/2A2||f||.||ct)|U. 

I T.21J|=|I| 
JC3I\I 

But the essential points are on the one hand that we have 

L L (4>,Hf)^< 114)11^, 

I T,21J|=|I| 
' ■ JC3I\I 
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since the length and location of J is specified by I. And on the other hand, that we have the 
estimate 



' ■ JC3I\I 

To be concrete, let 9, 9' G {±}, and consider the sum 



Y_ (HaIe,H]r) 



21Jh|I| 

Jci+(e'|i|) 



Now, 



li hi < CT(Ie) which is the estimate (2.1). And, we can apply (5.9) to see that 



S(I) 



J: 



J: 



2'IJI=|I| 



Jci+{e'|i|) ' ■ jci+(e'|i|) 

We ignore the contribution from energy. But, the intervals J are good. This means that 
dist(J, le) > llep^*^! Jl*^, which fact we use to estimate the Poisson integral above as follows. 

8.3. Lemma. Let ] cl C I', |J| = 2"'|I|, with s > r and J good, then 

P(ct- (I' -I), J) < 2-t^-"'^P(o-- 

Proof. Note that for x G I' — I we have 

dist(x,j)>|ir-^ijr=2^(i-'iji. 

Using this in the definition of the Poisson integral, we get 



P(a-(I'-I)J)<2 



dist(x, J)2 



CT(dx) 



< 



III 



I'-i 



dist(x,J))2 
HI 



(|I| + dist(x,I)) 



ff(dx) 

J o-(dx) = 2-^(^-^'^'P(I, a(I' - I]) 



□ 



Applying the Lemma, we have the estimate 

S(I) < 2-^'^-^^'T(cT ■ le, Ie)w(I + e'|I|) < A^cT(Ie) . 

8.4. The Term Bsj. For the term 63^1 of (4.6), we prove (4.7). This is a simple variant of the 
previous estimate. Namely, we have for 9 G {±}, 

Y_ (HJ_e,Hpi< A^a(I_e). 

JCI, I,=Ie 

Here, by I_0 we mean the child of I complementary to le. We omit the details of the argument. 
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